Overview

Dataset Statistics

Number of Variables 10
Number of Rows 94263
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 6
Duplicate Rows (%) 0.0%
Total Size in Memory 39.9 MB
Average Row Size in Memory 443.7 B
Variable Types
  • Categorical: 6
  • DateTime: 1
  • Numerical: 3

Dataset Insights

age_entry(years) is skewed Skewed
animal_id has a high cardinality: 79919 distinct values High Cardinality
breed has a high cardinality: 2646 distinct values High Cardinality
color has a high cardinality: 402 distinct values High Cardinality
animal_id has constant length 7 Constant Length

Variables


animal_id

categorical

Approximate Distinct Count 79919
Approximate Unique (%) 84.8%
Missing 0
Missing (%) 0.0%
Memory Size 6786936
  • The largest value (A721033) is over 2.36 times larger than the second largest value (A718223)

Length

Mean 7
Standard Deviation 0
Median 7
Minimum 7
Maximum 7

Sample

1st row A786884
2nd row A706918
3rd row A724273
4th row A682524
5th row A743852

Letter

Count 94263
Lowercase Letter 0
Space Separator 0
Uppercase Letter 94263
Dash Punctuation 0
Decimal Number 565578
  • animal_id contains many words: 79919 words
  • The largest value (a721033) is over 2.36 times larger than the second largest value (a718223)
  • animal_id has words of constant length

date

datetime

Distinct Count 4183.7459
Approximate Unique (%) 4.4%
Missing 0
Missing (%) 0.0%
Memory Size 1508208
Minimum 2013-10-01 00:00:00
Maximum 2025-04-09 00:00:00

age_entry(years)

numerical

Approximate Distinct Count 39
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1508208
Mean 2.4778
Minimum 0
Maximum 24
Zeros 880
Zeros (%) 0.9%
Negatives 0
Negatives (%) 0.0%
  • age_entry(years) is skewed right (γ1 = 1.9888)

Quantile Statistics

Minimum 0
5-th Percentile 0.08
Q1 0.5
Median 1
Q3 3
95-th Percentile 9
Maximum 24
Range 24
IQR 2.5

Descriptive Statistics

Mean 2.4778
Standard Deviation 2.9645
Variance 8.788
Sum 233560.49
Skewness 1.9888
Kurtosis 4.0926
Coefficient of Variation 1.1964
  • age_entry(years) is not normally distributed (p-value 8.700877903793048e-13)
  • age_entry(years) has 10059 outliers

intake_type

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 6881490
  • The largest value (Stray) is over 3.04 times larger than the second largest value (Owner Surrender)

Length

Mean 8.0031
Standard Deviation 4.4036
Median 5
Minimum 5
Maximum 18

Sample

1st row Stray
2nd row Stray
3rd row Stray
4th row Stray
5th row Owner Surrender

Letter

Count 724901
Lowercase Letter 601144
Space Separator 29494
Uppercase Letter 123757
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Stray, Owner Surrender) take over 50.0%
  • The largest value (stray) is over 3.04 times larger than the second largest value (owner)

intake_condition

categorical

Approximate Distinct Count 18
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 6694100
  • The largest value (Normal) is over 16.88 times larger than the second largest value (Injured)

Length

Mean 6.0151
Standard Deviation 0.4903
Median 6
Minimum 4
Maximum 10

Sample

1st row Normal
2nd row Normal
3rd row Normal
4th row Normal
5th row Normal

Letter

Count 566941
Lowercase Letter 472614
Space Separator 64
Uppercase Letter 94327
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Normal, Injured) take over 50.0%
  • The largest value (normal) is over 16.88 times larger than the second largest value (injured)

sex_upon_intake

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 7282604

Length

Mean 12.2584
Standard Deviation 1.0606
Median 13
Minimum 4
Maximum 13

Sample

1st row Neutered Male
2nd row Spayed Female
3rd row Intact Male
4th row Neutered Male
5th row Neutered Male

Letter

Count 1061995
Lowercase Letter 874212
Space Separator 93514
Uppercase Letter 187783
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Intact Male, Intact Female) take over 50.0%

breed

categorical

Approximate Distinct Count 2646
Approximate Unique (%) 2.8%
Missing 0
Missing (%) 0.0%
Memory Size 7924244

Length

Mean 19.0653
Standard Deviation 7.0889
Median 19
Minimum 3
Maximum 54

Sample

1st row Beagle Mix
2nd row English Springer S...
3rd row Basenji Mix
4th row Doberman Pinsch/Au...
5th row Labrador Retriever...

Letter

Count 1618329
Lowercase Letter 1345369
Space Separator 163213
Uppercase Letter 272960
Dash Punctuation 0
Decimal Number 0
  • breed contains many words: 1847 words
  • The largest value (mix) is over 3.78 times larger than the second largest value (bull)

color

categorical

Approximate Distinct Count 402
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Memory Size 7010293
  • The largest value (Black/White) is over 2.08 times larger than the second largest value (Brown/White)

Length

Mean 9.3695
Standard Deviation 3.691
Median 10
Minimum 3
Maximum 27

Sample

1st row Tricolor
2nd row White/Liver
3rd row Sable/White
4th row Tan/Gray
5th row Chocolate

Letter

Count 811287
Lowercase Letter 645113
Space Separator 8567
Uppercase Letter 166174
Dash Punctuation 0
Decimal Number 0

year

numerical

Approximate Distinct Count 13
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1508208
Mean 2018.1852
Minimum 2013
Maximum 2025
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • year is skewed right (γ1 = 0.3334)

Quantile Statistics

Minimum 2013
5-th Percentile 2014
Q1 2015
Median 2018
Q3 2021
95-th Percentile 2024
Maximum 2025
Range 12
IQR 6

Descriptive Statistics

Mean 2018.1852
Standard Deviation 3.1843
Variance 10.1395
Sum 1.9024e+08
Skewness 0.3334
Kurtosis -0.9097
Coefficient of Variation 0.001578
  • year is not normally distributed (p-value 6.97993634893993e-05)

month

numerical

Approximate Distinct Count 12
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1508208
Mean 6.4849
Minimum 1
Maximum 12
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • month is skewed right (γ1 = 0.0049)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 3
Median 6
Q3 10
95-th Percentile 12
Maximum 12
Range 11
IQR 7

Descriptive Statistics

Mean 6.4849
Standard Deviation 3.491
Variance 12.1868
Sum 611283
Skewness 0.004931
Kurtosis -1.2428
Coefficient of Variation 0.5383
  • month is not normally distributed (p-value 0.003456942825175045)

Interactions

Correlations

Missing Values